Machine translation using recursive chain-link-type learning based on translation examples

نویسندگان

  • Hiroshi Echizen-ya
  • Kenji Araki
  • Yoshio Momouchi
  • Koji Tochinai
چکیده

Rule-based machine translation analyzes source-language sentences using large-scale linguistic knowledge that is given by the developer beforehand. However, it is difficult to give complete linguistic knowledge to the system ex ante because natural language has various linguistic phenomena. Therefore, we worked to develop learning-based machine translation. In learning-based machine translation, a system acquires translation rules automatically from translation examples that are pairs of source and target language sentences. However, existing learning-based machine translation presents the problem that it requires a large number of similar translation examples. Consequently, it cannot acquire enough useful translation rules from sparse translation examples. This paper proposes a method of machine translation using Recursive Chain-Link-type Learning, which can acquire many useful translation rules from sparse translation examples. Our system, based on this method, efficiently acquires translation rules from each translation example without requiring two similar translation examples. Translation rules are acquired by extracting corresponding parts between source and target language sentences in translation examples. Our system determines those corresponding parts using previously acquired translation rules. Therefore, the system engenders a chain reaction in acquisition of new translation rules. Evaluation experiments using our system demonstrated an effective translation rate of 61.1%. Moreover, the effective translation rate was 85.0% when sufficient learning data were given to our system. © 2004 Wiley Periodicals, Inc. Syst Comp Jpn, 35(2): 1–15, 2004; Published online in Wiley InterScience (www.interscience.wiley.com). DOI 10.1002/scj.10511

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Study of Practical Effectiveness for Machine Translation Using Recursive Chain-link-type Learning

A number of machine translation systems based on the learning algorithms are presented. These methods acquire translation rules from pairs of similar sentences in a bilingual text corpora. This means that it is difficult for the systems to acquire the translation rules from sparse data. As a result, these methods require large amounts of training data in order to acquire high-quality translatio...

متن کامل

Using Reordering in Statistical Machine Translation based on Recursive Alignment Block Classification

Statistical Machine Translation (SMT) is based on alignment models which learn from bilingual corpora the word correspondences between the source and target language. These models are assumed to learn word reorderings. However, the difference in word order between two languages is one of the most important sources of errors in SMT. This paper proposes a Recursive Alignment Block Classification ...

متن کامل

Bilingually-Constrained Recursive Neural Networks with Syntactic Constraints for Hierarchical Translation Model

Hierarchical phrase-based translation models have advanced statistical machine translation (SMT). Because such models can improve leveraging of syntactic information, two types of methods (leveraging source parsing and leveraging shallow parsing) are applied to introduce syntactic constraints into translation models. In this paper, we propose a bilingually-constrained recursive neural network (...

متن کامل

A Hybrid Machine Translation System Based on a Monotone Decoder

In this paper, a hybrid Machine Translation (MT) system is proposed by combining the result of a rule-based machine translation (RBMT) system with a statistical approach. The RBMT uses a set of linguistic rules for translation, which leads to better translation results in terms of word ordering and syntactic structure. On the other hand, SMT works better in lexical choice. Therefore, in our sys...

متن کامل

The Correlation of Machine Translation Evaluation Metrics with Human Judgement on Persian Language

Machine Translation Evaluation Metrics (MTEMs) are the central core of Machine Translation (MT) engines as they are developed based on frequent evaluation. Although MTEMs are widespread today, their validity and quality for many languages is still under question. The aim of this research study was to examine the validity and assess the quality of MTEMs from Lexical Similarity set on machine tra...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Systems and Computers in Japan

دوره 35  شماره 

صفحات  -

تاریخ انتشار 2004